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HYBRID MULTIPLIERS IMPLEMENTED USING DSP CIRCUITRY AND 
PROGRAMMABLE LOGIC CIRCUITRY 



Background of the Invention 

[0001] The present invention relates to programmable 
5 logic integrated circuitry devices and, more 

particularly, the present invention relates to hybrid 
multipliers implemented in programmable logic 
integrated circuitry devices having integrated DSP 
( "DSP") circuitry. 

10 [0002] Programmable logic devices ("PLDs") are well 
known as shown in, for example, Jefferson et al. U.S. 
Patent No. 6,215,326 and Ngai et al. U.S. Patent No. 
6,407,576. PLD technology is well known for its 
ability to allow one common hardware design (embodied 

15 in an integrated circuit) to be programmed to meet the 
needs of many different applications. PLDs with a 
given hardware design can be manufactured in large 
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quantities at low cost. Each user then programs PLDs 
of that kind to meet that user's particular needs. The 
user does not have to do a custom integrated circuit 
design, with the attendant high cost, delay, and 
5 difficulty of revision if modifications are 
subsequently needed . 

[0003] One of the problems facing users of PLDs is 
the limited logic capacity of any particular PLD. As 
applications become more complex, the logic resources 

10 and their interconnections in a PLD become limiting 

factors in the ability to implement particular designs 
in the PLD. This is at least partially attributable to 
the fact that a relatively large portion of 
programmable logic and interconnections are typically 

15 used to implement signal or data processing tasks 

(e.g., digital signal processing ("DSP") tasks) that 
would otherwise require comparatively few resources if 
implemented using DSP circuitry. 

[0004] In an attempt to help alleviate this problem, 
20 recently, PLDs have been manufactured that include, in 
addition to prograimnable logic components, DSP 
components that implement common DSP tasks. Such 
common DSP tasks include, for example, multiplication. 
However, the multipliers implemented in the DSP 
25 circuitry of PLDs are not optimized for all types of 
multiplication tasks. For example, the multipliers 
embedded in the DSP circuitry of any particular PLD are 
limited to one or more particular sizes (e.g., 9 bits 
by 9 bits, 18 bits by 18 bits, 36 bits by 36 bits, 
30 etc . ) . 

[0005] In order to multiply data having sizes that 
do not conform with one of the available sizes, 
present-day hardware compilers implement such a 
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multiplication operation in a single multiplier that 
would be able to accommodate such a multiplication. 
This results in inefficient use of resources. For 
example, when compiling a 10 bits by 10 bits 
5 multiplication when only 9 bits by 9 bits multipliers 
and 18 bits by 18 bits multipliers are available, the 
18 bits by 18 bits multiplier would be used, thus 
making inefficient use of a larger multiplier. 
[0006] It would therefore be desirable to provide a 
10 more efficient implementation of multipliers embedded 
in the DSP circuitry that is integrated in PLDs. 

Summary of the Invention 

[0007] It is therefore an object of this invention 

15 provide a more efficient implementation of multipliers 
in PLDs having integrated DSP circuitry. 
[0008] A user logic design to hardware application 
is provided. The user logic design to hardware 
application compiles a user logic design into hardware 

20 constructs to be implemented in a PLD. 

[0009] The user logic design to hardware application 
may determine, based on any suitable factors, that 
using a multiplier embedded in the DSP circuitry of a 
target PLD to implement a user logic design multiplier 

25 is inefficient (e.g., because the user logic design 

multiplier is relatively small compared to the size of 
the smallest available DSP multiplier that can 
implement the user logic design multiplier, resulting 
in a waste of DSP circuitry) . However, implementation 

30 of the user logic design multiplier in solely 

programmable logic circuitry would result in a waste of 
too much programmable logic (i.e., given the complexity 
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of implementing multipliers in programmable logic 
circuitry) . 

[0010] Based on this determination, the user logic 
design to hardware application may implement the user 
5 logic design multiplier as a hybrid multiplier making 
use of both DSP circuitry and of programmable logic 
circuitry. In one suitable embodiment of the present 
invention, the user logic design to hardware 
application may select that largest available DSP 

10 multiplier that is smaller than the size of the user 
logic design multiplier to use as one partial product 
in an sum of partial product expansion. 
[0011] More particularly, the user logic design to 
hardware application may implement a user logic design 

15 multiplier by decomposing the user logic design 

multiplier into component multipliers. Each component 
multiplier generates a partial product. The sum of all 
of the partial products generated by the component 
multipliers of a user logic design multiplier is at 

20 least substantially equivalent to the product generated 
by the user logic design multiplier. 
[0012] In one suitable approach, the user logic 
design to hardware application implements at least one 
of the component multipliers using the largest DSP 

25 multiplier that is still smaller than the size of the 
user logic design multiplier. The remaining component 
multipliers are implemented using programmable logic 
circuitry. The component multipliers are arranged in a 
circuit that suitably generates the sum of the partial 

30 products generated by the component multipliers. For 
example, the input data of the user logic design 
multiplier is partitioned and separated into 
appropriate strings of bits for use as inputs into the 
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component multipliers. Appropriate shifting and 
padding of the products of the component multipliers 
may be implemented using any suitable arrangement of 
circuitry. An adder may be used to combine the partial 
5 products in order to generate the sum of partial 

products (i.e., the products of the user logic design 
multiplier) . 

Brief Description of the Drawings 
10 [0013] The above and other objects of the present 
invention will be apparent upon consideration of the 
following detailed description, taken in conjunction 

• with the accompanying drawings, in which like reference 
characters refer to like parts throughout, and in 

15 which: 

[0014] FIG. 1 is a block diagram of an illustrative 

• PLD having DSP circuitry in accordance with one 
embodiment of the present invention; 

[0015] FIG. 2 is a block diagram of the illustrative 
20 DSP circuitry of FIG. 1 in accordance with one 
embodiment of the present invention; 

[0016] FIG. 3 is a representation of a user logic 
design multiplier decomposed into component multipliers 
used for obtaining partial products in accordance with 

25 one embodiment of the present invention; 

[0017] FIG. 4 is a generalized representation of a 

user logic design multiplier decomposed into three 
component multipliers used for obtaining partial 
products in accordance with one embodiment of the 

30 present invention; 

[0018] FIG. 5 is a generalized representation of a 
user logic design multiplier decomposed into four 
component multipliers used for obtaining partial 
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products in accordance with one embodiment of the 
present invention; 

[0019] FIG- 6 is a schematic diagram of a particular 
implementation of a user logic design multiplier in 
5 accordance with one embodiment of the present 
invention; 

[0020] FIG. 7 is a more detailed schematic diagram 
of the particular implementation of the user logic 
design multiplier of FIG. 6 in accordance with one 

10 embodiment of the present invention; 

[0021] FIG. 8 is a schematic diagram of a particular 

special case implementation of a user logic design 
multiplier in accordance with one embodiment of the 
present invention; 

15 [0022] FIG. 9 is a flow-chart of illustrative steps 
involved in implementing user logic design multipliers 
as hybrid multipliers in a target PLD in accordance 
with one embodiment of the present invention; 
[0023] FIG. 10 is a simplified schematic diagram of 

20 an illustrative system employing a PLD in accordance 
with one embodiment of the present invention; and 
[0024] FIG. 11 is a schematic diagram of an 
illustrative machine-readable medium that may be 
encoded with machine-readable instructions for use in 

25 implementing a user logic design in a PLD. 

Detailed Description of the Invention 
[0025] Regions in a programmable logic device 
("PLD") may be dedicated to digital signal processing 
30 ("DSP") circuitry. Circuitry for providing DSP 

operations may be concentrated in a region in a PLD 
(e.g., concentrated only in that region). A plurality 

of such regions may be arranged in a PLD. DSP 
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circuitry may be provided to perform multistage DSP 
operations. The organization of the DSP circuitry may 
make that circuitry open for use in many commonly used 
DSP operations. A desired DSP operation of interest 
5 may be selected from a particular set of DSP operations 
that the DSP region is configured to perform. A PLD 
with DSP regions may have other resources (e.g., 
programmable logic regions, memory, etc . ) that are 
integrated with the DSP regions for high speed, low 

10 latency, or quick implementation of applications that 
are currently in high demand, such as video image 
processing, digital communications signal processing, 
or other applications that involve a combination of 
logic operations and DSP operations for suitable 

15 operation . 

[0026] A PLD may have an architecture that is based 
on concentrating circuitry in different regions based 
on purpose. For example, with reference to FIG. 1, PLD 
10 may include programmable logic circuitry 20 (e.g., 

20 programmable logic circuitry 20 that is concentrated in 
block-shaped regions in PLD 10), memory circuitry 30 
(e.g., memory circuitry 30 that is concentrated in 
block-shaped regions in PLD 10) , DSP circuitry 50 
(e.g., DSP circuitry 50 that is concentrated in 

25 block-shaped regions in PLD 10), and distributed 
interconnect circuitry 4 0 . 

[0027] Distributed interconnect circuitry 40 may be 

distributed over PLD 10 to provide connectivity between 
programmable logic circuitry 20, memory circuitry 30, 
30 and DSP circuitry 50. Programmable logic circuitry 20, 
memory circuitry 30, and DSP circuitry 50 may include 
local interconnect resources for forming 
interconnections between local resources . DSP 
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circuitry 50 may be substantially mutually exclusive of 
circuitry, layout, functionality, or any combination 
thereof of programmable logic circuitry 20, memory 
circuitry 30, distributed interconnect circuitry 40, or 
5 any combination thereof. 

[0028] DSP circuitry 50 may be a medium through 
which PLD 10 processes certain types of digital signals 
to achieve suitable operating speeds without 
substantially impairing or using other resources of PLD 

10 10. DSP circuitry 50 may have capabilities that 
complement the capabilities of programmable logic 
circuitry 20. The complementary capabilities may 
sustain substantially concurrent use of substantially 
all of programmable logic circuitry 20 and DSP 

15 circuitry 50. If desired, DSP circuitry 50 may operate 
without relying on distributed interconnect circuitry 
40 for performing internal DSP operations. Thus, DSP 
circuitry 50 may operate as a high speed isolated DSP 
channel . 

20 [0029] DSP circuitry 50 may include multipliers and 
other DSP circuitry. For example, DSP circuitry 50 may 
include DSP circuitry 60 of FIG. 2. With reference now 
to FIG. 2, DSP circuitry 60 may include one or more 
multiplier circuits 62, interconnection circuitry 64, 

25 and one or more DSP stages 66. 

[0030] Multiplier circuits 62 may include a 
plurality of individual multiplier circuits, a 
plurality of multipliers operable in parallel to 
provide a first DSP stage, four parallel n bits by n 

30 bits multipliers that are each configurable to be m 

parallel n/m bits by n/m bits multipliers where n/m is 
an integer, or any other suitable form of multipliers. 
DSP stage (s) 66 may include single stage or multistage 
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DSP circuitry. DSP stage 66 may include adder 
circuitry, subtracter circuitry, accumulator circuitry, 
or any combination thereof. DSP stage 66 and 
multiplier circuits 62 may include other related 
5 circuitry. Interconnection circuitry 64 may include 
' connecting conductors and circuitry that interconnects 
DSP stage 66 with multiplier circuits 62. The 
interconnections may be flexible or dedicated. 
Interconnection circuitry 64 may include conductors and 
10 circuitry for connecting DSP circuitry 62 with other 
resources in a PLD. Interconnection circuitry 64 may 
be dedicated to supporting operations in DSP circuitry 
60. 

[0031] DSP circuitry 60 is illustrative of the 
15 content and organization of a single DSP circuit block. 
Together, multiplier circuits 62 and DSP stage 66 may 
include a combination of processing elements that are 
common to a number of often used DSP operations such as 
infinite impulse response ("IIR") filters, direct form 
20 I finite impulse response ("FIR") filters, direct form 
II FIR filters, multiply accumulate operations, or any 
other suitable DSP operations. 

[0032] A more detailed discussion of the structure 
of a PLD having integrated DSP circuitry is found in 
25 commonly-assigned Langhammer et al. U.S. Patent No. 

6,538,470, which is hereby incorporated by reference 
herein in its entirety. 

[0033] It will be understood that the foregoing 
description of a structure of a PLD having integrated 
30 DSP circuitry is merely illustrative. The present 

invention may be practiced with respect to any other 
suitable PLD structure having integrated DSP circuitry. 
The present invention may, in fact, be practiced with a 
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PLD coupled to external DSP circuitry. For purposes of 
clarity and brevity, the present invention is described 
herein in terms of a PLD having integrated DSP 
circuitry. 

5 [0034] The present invention is described herein 
primarily in terms of a user logic design to hardware 
application that compiles a user logic design into 
hardware (i.e., contained in a PLD). It will be 
understood that the features of the present invention 

10 may be practiced through other ways. For example, the 
features of the present invention may be practiced 
without the aid of an application by allowing the user 
to manually implement design components into PLDs . 
[0035] The user logic design to hardware application 

15 may include any suitable hardware, software, or both 
that is used to compile a user logic design into 
hardware constructs to be implemented in a PLD. The 
user logic design to hardware application may 
automatically implement the components of a user logic 

20 in one or more PLDs. 

[0036] It will be understood that the user logic 
design to hardware application may be configured to 
take into account user preferences with respect to how 
any particular implementation is to take place. For 

25 example, user preferences are taken into account when 
the user logic design to hardware application needs to 
make decisions with respect to how certain components 
are implemented, routed, or the like. The user logic 
design to hardware application may provide the user 

30 with the ability to review and change any 

implementation decisions made by the user logic design 
to hardware application during the compilation process. 
The user logic design to hardware application may take 
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into account other factors, such as the target PLD type 
(e.g., including, for example, the type of integrated 
DSP circuitry, amount of programmable logic, types of 
interconnections, any other suitable information about 
5 the target PLD, or any combination thereof) , 

information about the circuit being implemented (e.g., 
types of components in the user logic design, number of 
particular components in the user logic design, 
application for which the user logic design has been 

10 created (e.g., video processing, network-related, or 
any other suitable application) , size, any other 
suitable information about the circuit being created, 
or any combination thereof) , any other suitable 
factors, or any combination thereof. 

15 [0037] Because multipliers implemented in the DSP 
circuitry are optimized for certain sizes, user logic 
design multipliers of sizes that are different from 
these certain sizes do not efficiently map to what is 
available in the DSP circuitry. For example, in the 

20 Stratix™ family of field-programmable gate array 

products, manufactured by Altera® Corporation of San 
Jose, California, DSP circuitry integrated in the PLD 
contain multipliers optimized for sizes of 9 bits by 9 
bits, 18 bits by 18 bits, and 36 bits by 36 bits. In 

25 order to implement a user logic design multiplier 

having a size of 10 bits by 10 bits, an 18 bits by 18 
bits DSP multiplier would need to be used in which most 
of the capability of the DSP circuitry being used to 
implement the larger 18 bits by 18 bits multiplier is 

30 not being used. 

[0038] Conventionally, when a user logic design 
multiplier exceeds one of the sizes supported by the 
DSP circuitry, the next highest sized DSP multiplier 
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for which the DSP circuitry is optimized must be used. 
In a worst case scenario^ if every multiplier of a user 
logic design exceeds the relative sizes for which the 
DSP circuitry of a PLD is optimized, most of the DSP 
5 circuitry used to implement the DSP multipliers will be 
wasted. 

[0039] Because implementing multipliers using only 
programmable logic circuitry would result in large 
overly-complex implementations that use too many of the 

10 programmable logic circuitry resources^ it would be 
preferable to implement hybrid multipliers in which 
both programmable logic circuitry and DSP circuitry are 
used. Hybrid multipliers are implemented by combining 
the DSP multipliers with additional multiplication 

15 being handled in programmable logic circuitry. 

[0040] This may be accomplished by using the DSP 
multiplier of largest size for which the desired user 
logic design multiplier to be implemented is too large. 
The remaining multiplication may then be processed by 

20 programmable logic circuitry configured accordingly. 
For example, with reference to Altera®' s Stratix™ 
family of PLDs, a 9 bits by 9 bits DSP multiplier may 
be used together with programmable logic circuitry to 
implement a 10 bits by 10 bits user logic design 

25 multiplier. 

[0041] In one embodiment of the present invention, 
the user logic design to hardware application may break 
down a user logic design multiplier into two or more 
other multipliers for implementation in hardware. For 

30 example, a user logic design multiplier that is 
slightly larger than one of the types of DSP 
multipliers embedded in the PLD but is significantly 
smaller than the next largest type of DSP multiplier 
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embedded in the PLD, may be broken down into one or 
more multipliers that may be implemented in one or more 
of the DSP multipliers and into one or more multipliers 
that may be implemented using programmable logic 
5 circuitry. 

[0042] In one suitable approach, this may be 
accomplished by decomposing the larger user logic 
design multiplier into component multipliers. The 
component multipliers may correspond to multipliers 

10 that produce partial products, the sum of which 

produces the product of the original larger user logic 
design multiplier. This decomposition is illustrated 
in FIG. 3, which shows an abstract representation of a 
user logic design multiplier 300. User logic design 

15 multiplier 300 is illustratively shown to be a 10 bits 
by 10 bits multiplier. If the target PLD has, for 
example, only 9 bits by 9 bits DSP multipliers and 18 
bits by 18 bits DSP multipliers, it is understood that 
it would be more efficient to make use of one of the 9 

20 bits by 9 bits DSP multipliers as opposed to one of the 
18 bits by 18 bits DSP multipliers. As shown in 
FIG. 3, multiplier 300 may be decomposed such that one 
of the components of the decomposed multiplier is 9 
bits by 9 bits multiplier 302 that may be efficiently 

25 implemented in one of the DSP multipliers of the target 
PLD. Multiplier 302 is illustrated, without loss of 
generality, as the upper left quadrant of multiplier 
300. The remaining portions of multiplier 300 may be 
decomposed in any suitable way. For* example, in one 

30 suitable approach, the remaining components may be 

decomposed to produce 1 bit by 9 bits multiplier 308, 1 
bit by 1 bits multiplier 306, and 9 bits by 1 bit 
multiplier 304. The user logic design to hardware 
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application may implement multipliers 304, 306, and 308 
in programmable logic circuitry, whereas multiplier 302 
is implemented in a 9 bits by 9 bits DSP multiplier. 
[0043] It will be understood that any suitable 
5 decomposition of multiplier 300 may be made based on 
any suitable factors. For example, if the user logic 
design to hardware application determines that (e.g., 
because of available target resources) a sum of three 
partial products would be more efficient (as opposed to 

10 a sum of four partial products in the above example) , 

the resultant implementation may include multiplier 302 
implemented in a 9 bits by 9 bits DSP multiplier, 9 
bits by 1 bit multiplier 304 implemented in 
programmable logic circuitry, and a 1 bit by 10 bits 

15 multiplier 310 implemented in programmable logic 

circuitry. Any other suitable decomposition may be 
used. For example, a 10 bits by 1 bit multiplier, a 1 
bit by 9 bits multiplier, and a 9 bits by 9 bits 
multiplier may be used. 

20 [0044] FIG. 4 is a generalized diagram of a user 

logic design multiplier 400 having a size of (x+y) bits 
by (s+t) bits. The user logic design to hardware 
application may decompose multiplier 400 into, for 
example, three multipliers 402, 404, and 406 (i.e., 

25 that each generate partial products) where the target 
PLD has DSP multipliers optimized for x bits by s bits 
(i.e., the closest to, without being larger than, (x+y) 
bits by (s+t) bits) . Multiplier 402 may, for example, 
be implemented in a x bits by s bits DSP multiplier. 

30 Multipliers 404 and 406 may be implemented using 
programmable logic circuitry. 

[0045] It will be understood that the decomposition 
of multiplier 400 as shown in FIG. 4 is merely 
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illustrative. Multiplier 400 may be decomposed into 
any suitable component multipliers for generating and 
suitable partial products based on any suitable 
factors, including, for example, user preferences. For 
5 example, FIG. 5 is a generalized diagram of multiplier 
4 00 being decomposed into four component multipliers 
502, 504, 506, and 508 (i.e., that generate partial 
products, the sum of which is the product of user logic 
design multiplier 400) where the target PLD has DSP 

10 multipliers optimized for x bits by s bits (i.e., the 
closest to, without being larger than, (x+y) bits by 
(s+t) bits) where x, y, s, and t represent positive 
integers. Multiplier 502 may, for example, be 
implemented in a x bits by s bits DSP multiplier. 

15 Multipliers 504, 506, and 508 may be implemented using 
programmable logic circuitry. Any such suitable 
decomposition may be made. 

[0046] FIG. 6 shows one implementation of a 10 bits 
•by 10 bits user logic design multiplier as a hybrid 

20 multiplier in hardware. As illustrated in FIG. 6, 
multipliers 600, 602, 604, and 606 represent the 
component multipliers of the user logic design 
multiplier being implemented (e.g., corresponding to 
multipliers 502, 504, 506, and 508 of FIG. 5). 

25 Multiplier 600 is a 9 bits by 9 bits multiplier that 

may be implemented using a DSP multiplier. Multiplier 
602 is a 9 bits by 1 bit multiplier implemented using 
programmable logic circuitry in the target PLD. 
Multiplier 604 is a 1 bit by 9 bits multiplier 

30 implemented using programmable logic circuitry in the 
target PLD. Multiplier 606 is a 1 bit by 1 bit 
multiplier implemented using programmable logic 
circuitry in the target PLD. 
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[0047] The user logic design to hardware application 
combines partial products using appropriate circuitry 
in order to implement the desired multiplier. Partial 
products 632, 634, 636, and 638 are generated from the 
5 multiplication of respective multiplicands 608, 612, 

616, and 620 with respective multipliers 610, 614, 618, 
and 622. The bits of each partial product 632, 634, 
636, and 638 are shifted and padded accordingly, as 
necessary, by respective circuitries 624, 626, 628, and 
10 630. The resultant signals 640, 642, 644, and 646 are 
then added using, for example, adder 648 to produce sum 
of partial products 650 (i.e., the result of the 
original 10 bits by 10 bits user logic design 
multiplier) . 

15 [0048] FIG. 7 is a schematic diagram showing a more 
detailed implementation of the 10 bits by 10 bits user 
logic design multiplier of FIG. 6. For example, 
multipliers 602, 604, 606 that are implemented in 
programmable logic circuitry may be done so using AND 

20 gates 700, 702, and 704, respectively. The products of 
AND gates 700 and 702 may be combined using adder 706. 
The output of adder 706 is appended (i.e., using 
circuitry 710) with the product of AND gate 704 (i.e., 
after appropriate shifting and padding of bits using 

25 circuitry 708). That result is then combined with the 
product of multiplier 600 (i.e., the bits of which have 
been appropriately shifted and padded using circuitry 
712) using adder 648. 

[0049] FIGS. 6 and 7 are merely illustrative 
30 arrangements that may be used to implement a hybrid 
multiplier in accordance with the present invention. 
Any other suitable arrangements may be used. For 
example, different arrangements of components may be 
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used for the sum of four partial products 
implementation. Also, it will be understood that 
although FIGS. 6 and 7 illustrate an implementation of 
a 10 bits by 10 bits multiplier using a sum of four 
5 partial products, any suitable such arrangement may be 
used in an implementation that uses the sum of any 
other suitable umber of partial products (e.g., three). 
[0050] In one embodiment of the present invention, 

the user logic design to hardware application may be 

10 configured to consider the special case of when a user 
logic design multiplier exceeds the optimized sizes of 
DSP multipliers implemented in the target PLD by one 
bit on only one side of the multiplier. For example, 
if the user logic design requires that a 19 bits by 18 

15 bits multiplier be implemented, and the target PLD has 
as the closest match 18 bits by 18 bits DSP 
multipliers, then the user logic design to hardware 
application may implement the user logic design 
multiplier in hardware as illustrated in FIG. 8. 

20 [0051] The arrangement shown in FIG. 8 us;es a sum of 
two partial products to implement a 19 bits by 18 bits 
user logic design multiplier in the target PLD. 
Multiplier 804 is an 18 bits by 18 bits DSP multiplier 
that multiplies 18 bit multiplicand 800 with 18 bit 

25 multiplier 802 to produce partial product 820. The 

second partial product (i.e., partial product 822) is 
generated by a 1 bit by 18 bits multiplier implemented 
using AND gate 814. 1 bit multiplicand 812 is 
multiplied by 18 bit multiplier 810 to produce partial 

30 product 822. The respective bits of partial products 

820 and 822 are shifted and padded, as necessary, using 
appropriate circuitry 806 and 816. The partial 
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products are combined using adder 808 to produce 
multiplier output 818. 

[0052] FIG. 8 is merely an illustrative arrangement. 
Any other suitable arrangement may be used using any 
5 suitable design and combination of components in 

accordance with the present invention. For example, 
the user logic design to hardware application need not 
identify this particular special case of user logic 
design multiplier and may use an implementation such as 

10 that described with respect to FIGS, 7 and 8. 

[0053] The implementation illustrated in FIG. 8 may 
be of particular applicability with respect to FIR 
filters. Symmetrical FIR filters typically require a 
single bit of growth on one of the inputs to the 

15 multiplier as a result of a pre-add operation. For 
example, if a symmetrical FIR filter requires 9 bit 
data and 9 bit coefficients, the pre-add associated 
with symmetry will then require 10 bits by 9 bits 
multipliers. Similarly, if a symmetrical FIR filter 

20 requires 18 bit data and 18 bit coefficients, 19 by 18 
bits multipliers will be needed. 

[0054] It will be understood that although the 

present invention is primarily described herein in 
terms of DSP multipliers being optimized for symmetric 
25 sizes (i.e., n bits by n bits), the present invention 
may be practiced for DSP multipliers optimized for 
asymmetrical sizes (i.e., n bits by m bits, where n ^ 
m) . 

[0055] FIG. 9 is a flow-chart of illustrative steps 
30 involved in implementing user logic design multipliers 
as hybrid multipliers in a target PLD in accordance 
with one embodiment of the present invention. The user 
logic design to hardware application examines a user 
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logic design multiplier 900 and at step 902 determines 
whether any of the DSP multipliers in the target PLD 
may be used in the implementation of the user logic 
design multiplier. If such DSP multipliers do not 
5 exist (e.g., because all are already being used, those 
available are too small, or for any other suitable 
reason) , then the user logic design multiplier may be 
implemented using, for example, solely programmable 
logic circuitry. 

10 [0056] If, at step, 902 the user logic design to 

hardware application determines that there are 
available DSP multiplier resources available, then the 
user logic design to hardware application determines 
which of the available sizes to use. For example, in 

15 one suitable approach, the user logic design to 

hardware application may use an embedded multiplier 
having dimensions closest to that of the user logic 
design multiplier. In another suitable approach, the 
user logic design to hardware application may analyze 

20 any suitable efficiency factors associated with 

implementing a user logic design multiplier in either a 
larger DSP multiplier (i.e., not requiring a sum of 
partial product expansion implementation using the 
hybrid features of the present invention) , or in a 

25 smaller DSP multiplier in accordance with the present 
invention. Such factors may involve comparing the 
value of using the larger multiplier to the savings in 
programmable logic that would be expended in 
implementing the user logic design multiplier in 

30 accordance with the combination of DSP circuitry and 

programmable logic circuitry arrangement in accordance 
with the present invention. 
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[0057] Assuming that an implementation that makes 
use of the combination of DSP circuitry and 
programmable logic circuitry is used, then at step 904, 
the user logic design to hardware application 
5 determines the particular sum of partial product 

expansion to use (e.g., based on the DSP multiplier or 
multipliers that will be used, size of the user logic 
design multiplier, amount of programmable logic 
circuitry to be used, speed of the resultant circuit, 
10 any other suitable factors, or any combination 
thereof) . 

[0058] At step 906, the user logic design to 
hardware application determines the circuit 
implementation of the sum of partial product expansion 

15 identified at step 904. 

[0059] At step 908, the user logic design to 
hardware application implements the user logic design 
multiplier in the target PLD using the circuit 
implementation identified at step 906. 

20 [0060] The flow-chart of FIG. 9 is merely 

illustrative. Any other suitable steps may be used in 
place of or in addition to any of those illustrated in 
accordance with the present invention. Any suitable 
steps may be removed from the flow-chart - 

25 [0061] FIG. 10 illustrates a PLD 10 (FIG. 1) of this 

invention (i.e., having at least one hybrid multiplier 
implemented therein) in a data processing system 1000 
in accordance with one embodiment of the present 
invention. Data processing system 1000 may include one 

30 or more of the following components: a processor 1002; 
memory 1004; I/O circuitry 1006; and peripheral 
devices 1008. These components are coupled together by 
a system bus 1010 and are populated on a circuit 
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board 1012 which is contained in an end-user - 
system 1014. 

[0062] System 1000 may be used in a wide variety of 
applications, such as computer networking, data 
5 networking, instrumentation, video processing, DSP, or 
any other application where the advantage of using 
programmable or reprogrammable logic is desirable. PLD 
10 may be used to perform a variety of different logic 
functions. For example, PLD 10 may be configured as a 

10 processor or controller that works in cooperation with 
processor 1002. PLD 10 may also be used as an arbiter 
for arbitrating access to a shared resource in 
system 1000. In yet another example, PLD 10 may be 
configured as an interface between processor 1002 and 

15 one of the other components in system 1000. 

[0063] FIG. 11 is a schematic diagram of an 
illustrative machine-readable medium that may be 
encoded with machine-readable instructions for use in 
implementing a user logic design in a PLD. Machine- 

20 readable medium 1100 may be encoded with instructions 
for carrying out the features described herein with 
respect to the user logic design to hardware 
application. Machine-readable medium 1100 may be, for 
example, magnetic disks (e.g., floppy disks, hard disk 

25 drives), optical disks (e.g., CD-ROM/R/RW, DVD -R/- 

RW/+R/+RW/RAiyi/ROM) , EPROM, FLASH memory, a PLD, an ASIC 
device, any other suitable medium, or any combination 
thereof. Machine 1102 may be any suitable machine 
capable of reading machine-readable medium 1100, such 

30 as any suitable computer. 

[0064] Thus, a hybrid multiplier implemented using 
DSP circuitry and programmable logic circuitry is 
provided. One skilled in the art will appreciate that 
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the present invention can be practiced by other than 
the described embodiments, which are presented for 
purposes of illustration and not of limitation, and the 
present invention is limited only by the claims which 
5 follow. 



